Creating speaker-specific phonetic templates with a speaker-independent phonetic recognizer: implications for voice dialing

نویسندگان

  • Neena Jain
  • Ronald A. Cole
  • Etienne Barnard
چکیده

We present a new approach to speaker dependent template generation which uses dramatically less storage to represent a speaker's words, with minimal degradation in recognition accuracy. In this approach, the symbolic string produced by a speaker-independent phonetic recognizer is used to represent utterances. We investigate eeective procedures for template generation, and compare the results of these procedures to templates represented by acoustic parameters for utterances produced with diierent telephone handsets. The use of speaker-speciic templates led to a reduction of about 1:500 in data-storage requirements with comparable recognition accuracy. In also compare recognition performance for speaker-speciic and speaker-independent templates , and for combinations of the two. The results showed that combining speaker-speciic and speaker-independent templates produces better recognition performance than either alone. A voice dialing system is described which incorporates the speaker-speciic templates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonetic, idiolectal and acoustic speaker recognition

This paper describes a text-independent speaker recognition system that achieves an equal error rate of less than 1% by combining phonetic, idiolect, and acoustic features. The phonetic system is a novel language-independent speakerrecognition system based on differences among speakers in dynamic realization of phonetic features (i.e., pronunciation), rather than spectral differences in voice q...

متن کامل

Techniques for robust speech recognition in the car environment

The use of voice commands or navigation features in the car is becoming a necessity. As keyboard and display interfaces cannot be used safely while driving, much effort has been done to make automatic speech recognition (ASR) and Text-to-Speech synthesis (TTS) ubiquitous features in the car. From voice dialing to car navigation, the requirements for voice technology vary greatly. While the use ...

متن کامل

Speaker-dependent Speech Recognition Based on Phone-like Units Models | Application to Voice Dialing

This paper presents a speaker dependent speech recognition with application to voice dialing. This work has been developed under the constraints imposed by voice dialing applications, i.e., low memory requirements and limited training material. Two methods for producing speaker dependent word baseforms based on Phone Like Units (PLU) are presented and compared : (1) a classical vector quantizer...

متن کامل

Voice morphing and the manipulation of intra-speaker and cross-speaker phonetic variation to create foreign accent continua: a perceptual study

The STRAIGHT system of voice morphing was used to create voice continua of (Korean) accented Australian English, intended to simulate phonetic variation ranging from ‘heavily accented’ to ‘unaccented’ (native-like) Australian English, employing dimensions of intra-speaker and cross-speaker variation to yield a range of synthetic voices. These synthetic voices were evaluated against actual sampl...

متن کامل

Speaker-dependent speech recognition based on phone-like units models-application to voice dialling

This paper presents a speaker dependent speech recognition with application to voice dialing This work has been devel oped under the constraints imposed by voice dialing appli cations i e low memory requirements and limited training material Two methods for producing speaker dependent word baseforms based on Phone Like Units PLU are pre sented and compared a classical vector quantizer is used t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996